Optimize tree route to sync faster #3665

marcio-diaz · 2019-09-22T08:39:01Z

Summary

This PR removes some inefficiencies of tree_route and calling functions. Previously, tree_route was computing the path between two blocks by loading headers from database. Now, we cache the information needed in an LRU cache in-memory.
Also, many times we were using tree_route just to check if a block is descendant of another one. Instead, this PR introduces a function lowest_common_ancestor that compute descendants in O(1) (most of the time).

As a side effect, this improves syncing speed for Kusama from ~2 hours to ~20 minutes (25 bps -> 35 bps in wasm, 80 bps -> 400 bps in native).

Changes:

Introduce a CachedHeaderMetadata with fields hash, number, parent and ancestor. The ancestor field is used to quickly jump through the tree.
We cache the last 20_000 HeaderMetadata used, by means of a LRU cache. The optimal number depends on the difference between best and finalized block numbers.
Use CachedHeaderMetadata to traverse the tree in the tree_route function. Previously we were loading the whole headers from DB to do this task.
Implement a lowest_common_ancestor function and use it in is_descendent_of and other places, replacing tree_route. Right now, the query pattern of this function is as follows: lca(best, final), lca(best+1, final), lca(best+2, final), ..., lca(best+5000, final) . By using the ancestor field of LightHeader, the first query is O(n) and the followings O(1).

Notes:

I also tried implementing lowest_common_ancestor by dividing the tree in sections and jumping between them, getting O(sqrt(h)) for query, but it didn't work very well in practice.

core/client/src/blockchain.rs

rphmeier · 2019-09-23T10:00:11Z

I'm curious as to what other approaches you have tried. It seems much more memory and computation light to cache the results of queries to is_descendent_of themselves. My thesis:

is_descendent_of queries will be common for short chains near the HEAD
queries deeper into the chain will tend to focus on some very specific blocks

The memory cache approach is clearly faster than doing nothing, but it requires keeping around a big LRU cache

marcio-diaz · 2019-09-23T10:36:46Z

I'm curious as to what other approaches you have tried. It seems much more memory and computation light to cache the results of queries to is_descendent_of themselves. My thesis:
* `is_descendent_of` queries will be common for short chains near the HEAD

* queries deeper into the chain will tend to focus on some very specific blocks
The memory cache approach is clearly faster than doing nothing, but it requires keeping around a big LRU cache

It seems we are making many queries from imported block to finalized block, as I commented in the description, the cache is to keep this window. While syncing Kusama I have seen max differences of about 10k blocks. Probably we can reduce the cache size from 20k to 10k (or even more) and keep similar performance. I also thought about disabling it after main sync.

core/client/db/src/lib.rs

core/client/src/blockchain.rs

arkpar · 2019-09-24T11:16:05Z

Right now, the query pattern of this function is as follows: lca(best, final), lca(best+1, final), lca(best+2, final), ..., lca(best+5000, final)

I wonder where is this pattern coming from. I imagine we only need to check tree paths or ancestry w.r.t the final block only when finalizing, and not when importing or checking each block.

Does GRANDPA really need to check if each new block is descendent from the last finalized block?
@andresilva @rphmeier

core/client/src/blockchain.rs

rphmeier · 2019-09-24T13:40:14Z

Does GRANDPA really need to check if each new block is descendent from the last finalized block?
@andresilva @rphmeier

With any kind of finality, we always have to do this check. At the very least, if that block could be a new "best" block.

andresilva · 2019-09-24T16:10:17Z

Some explanation of where tree_routes are being calculated regarding consensus:

GRANDPA - we track pending authority set changes in a tree, we calculate tree_routes when making queries on this tree, either when importing a block if it includes a consensus change digest, or when finalizing a block to check if finalizing the given block finalizes anything in the tree.
BABE - epoch changes are announced one epoch in advance and we also track them in a tree. Whenever we import a block that signals an epoch change we must import it into the tree, and we also query for epoch that just started (previously announced). Some changes introduced in Fixing BABE epochs to change between blocks #3583 changed some of this logic, but I think now it should cost less as we minimize the number of calls to is_descendent_of when querying the tree for pending epochs.

Nodes in the tree have an implicit ancestry relationship that is given by is_descendent_of (which uses tree_route internally).

I doubt that on Kusama the issue is being caused by GRANDPA since there are no authority changes happening. Most likely it was being caused by BABE (although that should have been improved by #3583).

Regardless of any improvements on BABE having a way to speed up common ancestry queries makes sense to me.

rphmeier · 2019-09-24T16:36:55Z

In #3586 we do is_descendent_of calls for every block, since I removed the caching that it previously used to figure out the epoch fora block. I think a really small LRU-cache to track which blocks we recently queried epoch for (so we can see if the new block would be in the same epoch as parent) would kill 95% of those queries.

core/authority-discovery/src/lib.rs

Demi-Marie

LGTM although I am no expert on the database.

marcio-diaz · 2019-09-27T08:30:25Z

This is ready for another look, I introduced the crate and traits as @rphmeier suggested (although I'm not sure about the location and name).

core/client/header-metadata/src/lib.rs

andresilva

overall lgtm, minor nits.

core/client/header-metadata/src/lib.rs

core/client/db/src/lib.rs

core/client/src/in_mem.rs

core/client/src/light/blockchain.rs

marcio-diaz requested review from Demi-Marie and andresilva as code owners September 22, 2019 08:39

marcio-diaz added the A0-please_review Pull request needs code review. label Sep 22, 2019

marcio-diaz requested a review from arkpar September 22, 2019 08:43

marcio-diaz changed the title ~~Optimize tree route~~ Optimize tree route to sync faster Sep 23, 2019

rphmeier reviewed Sep 23, 2019

View reviewed changes

core/client/src/blockchain.rs Outdated Show resolved Hide resolved

rphmeier reviewed Sep 23, 2019

View reviewed changes

core/client/db/src/lib.rs Outdated Show resolved Hide resolved

rphmeier reviewed Sep 23, 2019

View reviewed changes

core/client/src/blockchain.rs Outdated Show resolved Hide resolved

arkpar reviewed Sep 24, 2019

View reviewed changes

core/client/src/blockchain.rs Outdated Show resolved Hide resolved

Demi-Marie reviewed Sep 24, 2019

View reviewed changes

core/authority-discovery/src/lib.rs Outdated Show resolved Hide resolved

core/authority-discovery/src/lib.rs Outdated Show resolved Hide resolved

Demi-Marie reviewed Sep 24, 2019

View reviewed changes

marcio-diaz added A3-in_progress Pull request is in progress. No review needed at this stage. and removed A0-please_review Pull request needs code review. labels Sep 26, 2019

marcio-diaz force-pushed the marcio/optimize-tree-route branch from 11af594 to 03b1c77 Compare September 27, 2019 08:25

marcio-diaz added A0-please_review Pull request needs code review. and removed A3-in_progress Pull request is in progress. No review needed at this stage. labels Sep 27, 2019

marcio-diaz force-pushed the marcio/optimize-tree-route branch from c5d05e1 to 5d8eb9f Compare September 27, 2019 12:55

marcio-diaz requested review from Demi-Marie, arkpar and rphmeier September 27, 2019 14:53

rphmeier reviewed Sep 30, 2019

View reviewed changes

core/client/header-metadata/src/lib.rs Show resolved Hide resolved

rphmeier reviewed Sep 30, 2019

View reviewed changes

core/client/header-metadata/src/lib.rs Show resolved Hide resolved

andresilva reviewed Sep 30, 2019

View reviewed changes

core/client/header-metadata/src/lib.rs Outdated Show resolved Hide resolved

core/client/db/src/lib.rs Outdated Show resolved Hide resolved

core/client/src/in_mem.rs Outdated Show resolved Hide resolved

core/client/src/light/blockchain.rs Outdated Show resolved Hide resolved

seerscode added 21 commits October 2, 2019 18:25

Use lowest_common_ancestor in informant.

1667fbf

Use lowest_common_ancestor and new tree_route in Client.

89a20f1

Use get_light_header in in_mem.

9933541

Use lowest_common_ancestor in is_descendent_of.

6d13ab2

Use light header in ancestry.

3004c17

Forward set/get light_header for Blockchain.

1d5645a

Add lru crate.

0bd2228

Add Cargo.lock.

303f2d5

Update cargo.lock

7afb9fd

Fix compilation.

3dcf173

Fix test compilation.

dc6ebb8

Fix cargo.lock

69c846c

Extract header metadata to own crate.

df7be28

Fix tests and remove unused code.

96c10cc

Adds some docs.

c789299

Fix cargo.lock

b8b976a

merge master

004b48a

Fix nits.

641755f

Use same lru cache everywhere.

5903fb5

Remove TreeBackend.

f0ecf0e

Add a couple of test cases more.

cbc5840

marcio-diaz force-pushed the marcio/optimize-tree-route branch from 6f2e7d8 to 8cb5b42 Compare October 2, 2019 17:14

seerscode added 3 commits October 2, 2019 19:14

Remove functions from import lines.

8cb5b42

Fix tests.

5621af9

Fix indent.

a993063

marcio-diaz added A8-looksgood and removed A0-please_review Pull request needs code review. labels Oct 2, 2019

marcio-diaz merged commit d7be290 into master Oct 2, 2019

marcio-diaz deleted the marcio/optimize-tree-route branch October 2, 2019 18:30

andresilva mentioned this pull request Oct 7, 2019

client: fix comparison of CachedHeaderMetadata in tree_route #3776

Merged

Optimize tree route to sync faster #3665

Optimize tree route to sync faster #3665

Uh oh!

Conversation

marcio-diaz commented Sep 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes:

Notes:

Uh oh!

Uh oh!

rphmeier commented Sep 23, 2019

Uh oh!

marcio-diaz commented Sep 23, 2019

Uh oh!

Uh oh!

Uh oh!

arkpar commented Sep 24, 2019

Uh oh!

Uh oh!

rphmeier commented Sep 24, 2019

Uh oh!

andresilva commented Sep 24, 2019

Uh oh!

rphmeier commented Sep 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Demi-Marie left a comment

Choose a reason for hiding this comment

Uh oh!

marcio-diaz commented Sep 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andresilva left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

marcio-diaz commented Sep 22, 2019 •

edited

Loading

rphmeier commented Sep 24, 2019 •

edited

Loading

marcio-diaz commented Sep 27, 2019 •

edited

Loading